
Substitute 47113.txt 
SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: Therion, Corporation 

(ii) TITLE OF THE INVENTION: Recombinant Pox Virus For 

Immunization Against MUCl Tumor-Associated Antigen 

(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON PEABODY LLP 

(B) STREET: 100 Summer street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2131 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for wi ndows version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 10/057,136 

(B) FILING DATE: 25-MN-2002 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/366,670 

(B) FILING DATE: 03-AUG-1999 

(C) CLASSIFICATION: 

(viii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US98/03693 

(B) FILING DATE: 24-FEB-1998 

(C) CLASSIFICATION: 

(ix) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/038,253 

(B) FILING DATE: 24-FEB-1997 

(x) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Eisenstein, Ronald I 

(B) REGISTRATION NUMBER: 30,628 

(C) REFERENCE/DOCKET NUMBER: 700953-047113-C2-RCE 

(xi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-345-6054 

(B) TELEFAX: 617-345-1300 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID N0:1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE description: SEQ ID N0:1: 



Gly 

1 

Arg 



ser Thr Ala Pro Pro Ala His Gly val Thr Ser Ala Pro Asp Thr 
5 10 15 



Pro Ala Pro 
20 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) topology: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGCTCCACCG CCCCCCCAGC CCACGGTGTC ACCTCGGCCC CGGACACCAG GCCGGCCCCG 60 
(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Pro Asp Thr Arg Pro Ala Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGCAGTACTG CACCACCGGC ACATGGCGTA ACATCAGCAC CTGATACAAG ACCTGCACCT 60 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGATCCACCG CGCCGCCTGC GCACGGAGTG ACGTCGGCGC CCGACACGCG CCCCGCTCCC 60 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGTCAACAG CTCCTCCCGC TCATGGGGTT ACTTCTGCTC CAGATACTCG CCCAGCTCCA 60 
(2) INFORIWIATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGTTCGACGG CCCCCCCTGC TCACGGTGTA ACATCCGCCC CGGATACCAG ACCGGCCCCT 60 
(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGCAGCACCG CACCGCCCGC ACACGGGGTC ACAAGCGCGC CAGACACTCG ACCTGCGCCA 60 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE description: SEQ ID NO: 9: 
GGAAGTACCG CTCCACCTGC ACACGGGGTC ACAAGCGCGC CAGACACTCG ACCTGCGCCA 60 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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GGGTCGACTG CCCCTCCGGC GCATGGTGTG ACCTCAGCTC CTGACACAAG GCCAGCCCCA 60 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
GGTTCAACGG CACCTCCAGC ACACGGAGTC ACGTCTGCAC CCGACACCCG TCCAGCTCCG 60 
(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGTAGTACAG CGCCACCCGC ACATGGCGTC ACGAGCGCTC CGGATACGAG ACCGGCGCCT 60 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGCTCCACCG CACCCCCAGC CCACGGTGTC ACCTCGGCCC CGGACACCAG GCGGGCCCCG 60 
GGCTCCACCC CGGCCCCG 78 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE description: SEQ ID NO: 14: 
GGCTCCACCG CCCCCCCAGC CCATGGTGTC ACCTCGGCCC CGGACAACAG GCCCGCCTTG 60 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGCTCCACCG CCCCTCCAGT CCACAATGTC ACCTCGGCC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gly Ser Thr Ala Pro Pro Ala His Gly val Thr Ser Ala Pro Asp Thr 

1 5 10 15 

Arg Arg Ala Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(c) STRANDEDNESS: single 
(D) topology: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gly ser Thr Ala Pro Pro Ala His Gly val Thr Ser Ala Pro Asp Asn 

1 5 10 15 

Arg Pro Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO I 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gly ser Thr Ala Pro Pro val His Asn val Thr ser Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .1524 
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(D) OTHER INFORMATION: 



txt 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ATG ACA CCG GGC ACC CAG TCT CCT TTC TTC CTG CTG CTG CTC CTC ACA 48 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

GTG CTT ACA GCT ACC ACA GCC CCT AAA CCC GCA ACA GTT GTT ACG GGT 96 

val Leu Thr Ala Thr Thr Ala Pro Lys Pro Ala Thr val val Thr Gly 
20 25 30 

TCT GGT CAT GCA AGC TCT ACC CCA GGT GGA GAA AAG GAG ACT TCG GCT 144 

Ser Gly His Ala Ser ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala 
35 40 45 

ACC CAG AGA AGT TCA GTG CCC AGC TCT ACT GAG AAG AAT GCT GTG AGT 192 

Thr Gin Arg ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala val Ser 
50 55 60 

ATG ACA AGC TTG ATA TCG AAT TCC GGT GTC CGG GGC TCC ACC GCC CCC 240 

Met Thr ser Leu lie Ser Asn ser Gly val Arg Gly ser Thr Ala Pro 
65 70 75 80 

CCA GCC CAC GGT GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC CCG GGC 288 

Pro Ala His Gly val Thr ser Ala Pro Asp Thr Arg Pro Ala Pro Gly 

85 90 95 

TCC ACC GCC CCC CCA GCC CAC GGT GTC ACC TCG GCC CCG GAC ACC AGG 336 

ser Thr Ala Pro Pro Ala His Gly val Thr Ser Ala Pro Asp Thr Arg 
100 105 110 

CCG GCC CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT GTC ACC TCG GCC 384 

Pro Ala pro Gly Ser Thr Ala Pro Pro Ala His Gly val Thr ser Ala 
115 120 125 

CCG GAC ACC AGG CCG GCC CCG GGC TCC ACC GCA CCC CCA GCC CAC GGT 432 

pro ASP Thr Arg Pro Ala Pro Gly ser Thr Ala Pro Pro Ala His Gly 
130 135 140 

GTC ACC TCG GCC CCG GAC ACC AGG CGG GCC CCG GGC TCC ACC CCG GCC 480 

val Thr ser Ala Pro Asp Thr Arg Arg Ala Pro Gly ser Thr Pro Ala 
145 150 155 160 

CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT GTC ACC TCG GCC CCG GAC 528 
Pro Gly ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp 

165 170 175 

ACC AGG CCG GCC CCG GGC TCC ACC GCC CCC CCA GCC CAT GGT GTC ACC 576 

Thr Arg Pro Ala Pro Gly ser Thr Ala Pro Pro Ala His Gly val Thr 
180 185 190 

TCG GCC CCG GAC AAC AGG CCC GCC TTG GGC TCC ACC GCC CCT CCA GTC 624 
Ser Ala Pro Asp Asn Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val 
195 200 205 

CAC AAT GTC ACC TCG GCC TCA GGC TCT GCA TCA GGC TCA GCT TCT ACT 672 

His Asn val Thr ser Ala Ser Gly Ser Ala Ser Gly ser Ala Ser Thr 
210 215 220 

CTG GTG CAC AAC GGC ACC TCT GCC AGG GCT ACC ACA ACC CCA GCC AGC 720 

Leu val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr pro Ala Ser 
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225 230 235 240 

AAG AGC ACT CCA TTC TCA ATT CCC AGC CAC CAC TCT GAT ACT CCT ACC 768 

Lys Ser Thr Pro Phe ser lie Pro Ser His His Ser Asp Thr Pro Thr 

245 250 255 

ACC CTT GCC AGC CAT AGC ACC AAG ACT GAT GCC AGT AGC ACT CAC CAT 816 

Thr Leu Ala Ser His ser Thr Lys Thr Asp Ala ser Ser Thr His His 
260 265 270 

AGC ACG GTA CCT CCT CTC ACC TCC TCC AAT CAC AGC ACT TCT CCC CAG 864 
ser Thr val Pro Pro Leu Thr Ser ser Asn His Ser Thr ser Pro Gin 
275 280 285 

TTG TCT ACT GGG GTC TCT TTC TTT TTC CTG TCT TTT CAC ATT TCA AAC 912 

Leu Ser Thr Gly val ser Phe Phe Phe Leu Ser Phe His lie Ser Asn 
290 295 300 

CTC CAG TTT CCT TCC TCT CTC GAA GAT CCC AGC ACC GAC TAC TAC CAA 960 

Leu Gin Phe Pro ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gin 
305 310 315 320 

GAG CTG CAG AGA GAC ATT TCT CAA ATG TTT TTG CAG ATT TAT AAA CAA 1008 

Glu Leu Gin Arg Asp lie ser Gin Met Phe Leu Gin lie Tyr Lys Gin 

325 330 335 

GGG GGT TTT CTG GGC CTC TCC AAT ATT AAG TTC AGG CCA GGA TCT GTG 1056 

Gly Gly Phe Leu Gly Leu Ser Asn lie Lys Phe Arg Pro Gly Ser Val 
340 345 350 

CTG GTA CAA TTG ACT CTG GCC TTC CGA GAA GGT ACC ATC AAT GTC CAC 1104 

Leu val Gin Leu Thr Leu Ala Phe Arg Glu Gly Thr lie Asn val His 
355 360 365 

GAC GTG GAG ACA CAG TTC AAT CAG TAT AAA ACG GAA GCA GCC TCT CGA 1152 

Asp Val Glu Thr Gin Phe Asn Gin Tyr Lys Thr Glu Ala Ala Ser Arg 
370 375 380 

TAT AAC CTG ACG ATC CCA GAC GTC AGC GTG AGT GAT GTG CCA TTT CCT 1200 

Tyr Asn Leu Thr lie Pro Asp val ser val Ser Asp val Pro Phe Pro 
385 390 395 400 

TTC TCT GCC CAG TCT GGG GCT GGG GTG CCA GGC TGG GGC ATC GCG CTG 1248 

Phe Ser Ala Gin Ser Gly Ala Gly val Pro Gly Trp Gly lie Ala Leu 

405 410 415 

CTC CTG CTG GTC TGT GTT CTG GTT GCG CTG GCC ATT GTC TAT CTC ATT 1296 

Leu Leu Leu val cys val Leu val Ala Leu Ala lie val Tyr Leu lie 
420 425 430 

GCC TTG GCT GTC TGT CAG TGC CGC CGA AAG AAC TAC GGG CAG CTG GAC 1344 

Ala Leu Ala val Cys Gin Cys Arg Arg Lys Asn Tyr Gly Gin Leu Asp 
435 440 445 

ATC TTT CCA GCC CGG GAT ACC TAC CAT CCT ATG AGC GAG TAC CCC ACC 1392 
lie Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr 
450 455 460 

TAC CAC ACC CAT GGG CGC TAT GTC CCC CCT AGC AGT ACC GAT CGT AGC 1440 
Tyr His Thr His Gly Arg Tyr val Pro Pro Ser Ser Thr Asp Arg Ser 
465 470 475 480 

CCC TAT GAG AAG GTT TCT GCA GGT AAT GGT GGC AGC AGC CTC TCT TAC 1488 
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Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu ser Tyr 

485 490 495 

ACA AAC CCA GCA GTG GCA GCC ACT TCT GCC AAC TTG TAG 1527 

Thr Asn Pro Ala Val Ala Ala Thr Ser Ala Asn Leu 
500 505 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


Met Thr 


Pro 


Gly 


Thr 


Gin 


Ser 


Pro 


Phe 


Phe 


1 




5 










10 


val Leu 


Thr 


Ala 


Thr 


Thr 


Ala 


Pro 


Lys 


Pro 






20 










25 


G1y 


Ser Gly 


His 


Ala 


Ser 


ser 


Thr 


Pro 


Gly 


35 










40 




Thr 


Thr Gin 


Arg 


Ser 


Ser 


val 


Pro 


Ser 


ser 


50 








55 




Gly 


val 


Met Thr 


ser 


Leu 


lie 


Ser 


Asn 


Ser 


65 








70 




Ala 






Pro Ala 


His 


Gly 


val 


■ 

Thr 


Ser 


Pro 


Asp 






85 










90 


Ser Thr 


Ala 


Pro 


pro 


Ala 


His 


Gly 


Val 


Thr 






100 








105 




Pro Ala 


Pro 


Gly 


Ser 


Thr 


Ala 


Pro 


Pro 


Ala 




115 








120 




Thr 


Pro Asp 


Thr 


Arg 


Pro 


Ala 


Pro 


Gly 


ser 


130 








135 






Ala 


val Thr 


Ser 


Ala 


Pro 


Asp 


Thr 


Arg 


Arg 


145 








150 






His 


Gly 


Pro Gly 


Ser 


Thr 


Ala 


Pro 


Pro 


Ala 






165 










170 


Thr Arg 


Pro 


Ala 


Pro 


Gly 


ser 


Thr 


Ala 


Pro 




180 






Ala 


185 


Gly 


Ser Ala 


pro 


Asp 


Asn 


Arg 


Pro 


Leu 




195 






200 




Ala 


His Asn 


val 


Thr 


Ser 


Ala 


ser 


Gly 


ser 


210 










215 






Ala 


Leu Val 


His 


Asn 


Gly 


Thr 


Ser 


Ala 


Arg 


225 






230 








His 


Lys Ser 


Thr 


Pro 


Phe 


ser 


He 


Pro 


ser 






245 










250 


Thr Leu 


Ala 


ser 


His 


Ser 


Thr 


Lys 


Thr 


Asp 






260 










265 




Ser Thr 


val 


Pro 


Pro 


Leu 


Thr 


Ser 


ser 


Asn 




275 










280 






Leu Ser 


Thr 


Gly 


val 


ser 


Phe 


Phe 


Phe 


Leu 


290 








295 


Glu 






Leu Gin 


Phe 


Pro 


ser 


ser 


Leu 


Asp 


Pro 


305 








310 




Gin 




Phe 


Glu Leu 


Gin 


Arg 


Asp 


He 


Ser 


Met 






325 










330 



45 

Glu Lys ASI 
60 

Arg Gly Sei 
75 



125 

Ala Pro Pro 

140 
Pro Gly Ser 
155 



205 

Ser Gly Ser 

220 
Thr Thr Thr 
235 



285 

ser Phe His 

300 
Ser Thr Asp 
315 

Leu Gin lie 
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Leu 


Leu 


Thr 




15 




val 


Thr 


Gly 


30 




Thr 


Ser 


Ala 


A 1 a 


va 1 


C A r* 

^er 


Tnr 


A 1 a 


rro 






80 


Ala 


Pro 


Gly 




95 


Asp 


Thr 


Arg 


110 






Thr 


ser 


Ala 


Ala 


His 


Gly 


Thr 


Pro 


Ala 






160 


Ala 


Pro 


Asp 




175 




Gly 


Val 


Thr 


190 






Pro 


Pro 


val 


Ala 


Ser 


Thr 


Pro 


Ala 


Ser 






240 


Thr 


Pro 


Thr 




255 




Thr 


His 


His 


270 






Ser 


Pro 


Gin 


He 


Ser 


Asn 


Tyr 


Tyr 


Gin 




320 


Tyr 


Lys 


Gin 


335 
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Gly Gly Phe Leu Gly Leu Ser Asn lie Lys Phe Arg Pro Gly Ser val 

340 345 350 

Leu val Gin Leu Thr Leu Ala Phe Arg Glu Gly Thr lie Asn val His 

355 360 365 

Asp val Glu Thr Gin Phe Asn Gin Tyr Lys Thr Glu Ala Ala Ser Arg 

370 375 380 

Tyr Asn Leu Thr lie Pro Asp val Ser val Ser Asp Val Pro Phe Pro 
385 390 395 400 

Phe ser Ala Gin Ser Gly Ala Gly val Pro Gly Trp Gly lie Ala Leu 

405 410 415 

Leu Leu Leu val Cys val Leu Val Ala Leu Ala lie val Tyr Leu lie 

420 425 430 

Ala Leu Ala val cys Gin Cys Arg Arg Lys Asn Tyr Gly Gin Leu Asp 

435 440 445 

He Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr 

450 455 460 

Tyr His Thr His Gly Arg Tyr val Pro Pro Ser Ser Thr Asp Arg Ser 
465 470 475 480 

Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr 

485 490 495 

Thr Asn Pro Ala val Ala Ala Thr Ser Ala Asn Leu 
500 505 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gly ser Thr Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly val Thr 
15 10 15 

Ser Ala Phe Asp Thr Arg Pro Ala Pro 
20 25 
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